How surprising is a simple pattern? Quantifying "Eureka!".
نویسنده
چکیده
Simple patterns are compelling. When all the observed facts fit into a simple theory or "story," we are intuitively convinced that the pattern must be real rather than random. But how surprising is a simple pattern, really? That is, given a pattern of featural data, such as the properties of a set of objects, how unlikely would the pattern be if they were actually generated at random? In conventional statistics dealing with patterns of numbers, this type of question would be answered by reference to a null distribution such as the t distribution. This paper gives the analogous answer in the realm of concept learning, that is, the formation of generalizations from patterns of featural data. Using a formal but psychologically valid definition of complexity, I derive and exhibit the distribution of subjective complexity under the hypothesis of no pattern. This leads directly to a number of applications, including a statistical test indicating whether an observed pattern is sufficiently simple that it is not likely to have been an accident: literally, the "significance of simplicity."
منابع مشابه
Resolving Redundancy: A Recurring Problem in a Lessons Learned System
The value of a lessons learned collection depends on how readily the right lesson can be retrieved at the right time. In this paper, we discuss one such collection, from the Eureka system for exchanging tips on photocopier repair. Feedback from Xerox photocopier repair technicians using Eureka indicates that redundancy in the knowledge base is a significant impediment to effective use of the co...
متن کاملQuantifying Isolation Anomalies
Choosing a weak isolation level such as Read Committed is understood as a trade-off, where less isolation means that higher performance is gained but there is an increased possibility that data integrity will be lost. Previously, one side of this trade-off has been carefully studied quantitatively – there are well-known metrics for performance such as transactions per minute, standardized bench...
متن کاملBayesian surprise attracts human attention
We propose a formal Bayesian definition of surprise to capture subjective aspects of sensory information. Surprise measures how data affects an observer, in terms of differences between posterior and prior beliefs about the world. Only data observations which substantially affect the observer's beliefs yield surprise, irrespectively of how rare or informative in Shannon's sense these observatio...
متن کاملBetween order and chaos
What is a pattern? How dowe come to recognize patterns never seen before? Quantifying the notion of pattern and formalizing the process of pattern discovery go right to the heart of physical science. Over the past few decades physics’ view of nature’s lack of structure—its unpredictability—underwent a major renovation with the discovery of deterministic chaos, overthrowing two centuries of Lapl...
متن کاملComparison of Two Spectrophotometric Methods for Quantifying Total Hydroxycinnamic Acids in Coneflower (Echinacea purpurea) Preparations
Background & Aim: Hydroxycinnamic acids are one of the most important bioactive substances of Echinacea drugs. These compounds possess immuno-enhancing activity and thus, total hydroxycinnamic acids are mostly used as the main criterion for quality control of Echinacea purpurea and its drugs. Hence, the quality control of Echinacea requires to develo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Cognition
دوره 93 3 شماره
صفحات -
تاریخ انتشار 2004